A new approach to instruction-idioms detection in a retargetable decompiler
نویسندگان
چکیده
Retargetable executable-code decompilation is a one of the most complicated reverse-engineering tasks. Among others, it involves de-optimization of compiler-optimized code. One type of such an optimization is usage of so-called instruction idioms. These idioms are used to produce faster or even smaller executable files. On the other hand, decompilation of instruction idioms without any advanced analysis produces almost unreadable high-level language code that may confuse the user of the decompiler. In this paper, we revisit and extend the previous approach of instruction-idioms detection used in a retargetable decompiler developed within the Lissom project. The previous approach was based on detection of instruction idioms in a very-early phase of decompilation (a front-end part) and it was inaccurate for architectures with a complex instruction set (e.g. Intel x86). The novel approach is based on delaying detection of idioms and reconstruction of code to the later phase (a middleend part). For this purpose, we use the LLVM optimizer and we implement this analysis as a new pass in this tool. According to experimental results, this new approach significantly outperforms the previous approach as well as the other commercial solutions.
منابع مشابه
On Anthropomorphism in Technology-Enhanced Language Learning: Does Modality Matter in Agent-Based Multimedia Instruction on L2 Idioms?
The present study aimed to satisfy a twofold purpose: On the one hand, it sought to verify the postulation that agent-based instruction could offer a compromise approach to teaching L2 idioms where form and meaning would be equally emphasized during instruction. Given that anthropomorphism has not been much under scrutiny, this research, on the other hand, sought to ascertain whether learning a...
متن کاملPreprocessing of Binary Executable Files Towards Retargetable Decompilation
The goal of retargetable machine-code decompilation is to analyze and reversely translate platform-dependent executable files into a high level language (HLL) representation. This process can be used for many different purposes, such as legacy code reengineering, malware analysis, etc. Retargetable decompilation is a complex task that must deal with a lot of different platform-specific features...
متن کاملThe Comparative Effect of Using Idioms in Conversation and Paragraph Writing on EFL Learners’ Idiom Learning
This study investigated the comparative effect of teaching idiomatic expressions through practicing them in conversation and paragraph writing on intermediate EFL learners’ idiom learning. The participants were sorted out of a population of 134 intermediate students in Zabansara Language School in Khorramabad based on their scores on a Preliminary English Test (PET) and an idiom test piloted in...
متن کاملDesign of an Automatically Generated Retargetable Decompiler
This paper presents a concept of a retargetable reverse compiler (i.e. a decompiler). This tool translates platform-specific binary applications into a high-level language (HLL) representation. A Python-like language was chosen as the target language. Our unique solution is automatically generated from the target platform description in the architecture description language (ADL) ISAC. The deco...
متن کاملIdiom Analysis: a Novel Approach to Dataflow Redundancy Identification
Computer architects have exploited properties of repeating patterns of computation in many applications ranging from instruction set architecture specification to cluster scheduling. Most existing techniques for detecting such patterns involve either identifying loops with high trip counts, or considering only adjacent instructions. Heretofore the general problem of exhaustively detecting patte...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Comput. Sci. Inf. Syst.
دوره 11 شماره
صفحات -
تاریخ انتشار 2014